Empirical mode decomposition-based spectral acoustic cues for disordered voices analysis
نویسندگان
چکیده
Cepstral-based acoustic cues for disordered voices analysis have been investigated in a number of studies. It has been shown that cepstral-based acoustic cues such as the harmonics-to-noise ratio (HNR), the amplitude of the first rhamonic (R1A) provide acoustic correlates for hoarse voice quality. The aim of this presentation is to investigate an acoustic analysis of speech by means of spectral acoustic cues obtained via empirical mode decomposition (EMD) of the log of the magnitude spectrum of the speech signal as an alternative to the cepstral-based acoustic cues. The spectral acoustic cues investigated in this article are the harmonics-tonoise ratio (HNR) and the amplitude of the first harmonic (H1A). The EMD-based spectral acoustic cues are evaluated on a corpus of synthetic stimuli generated by a synthesizer of disordered voices as well as on a corpus of natural sustained vowels comprising 251 normophonic and dysphonic speakers. The performances of the EMD-based spectral acoustic cues (HNREMD and H1A) are compared to those of cepstral-based acoustic cues (HNRceps and R1A). Experimental results show that the EMD-based spectral acoustic cues outperform the cepstral-based acoustic measures in terms of the correlation with the perceived degree of hoarseness defined as the global quality of the voice and provided by the grade (G) in the GRBAS scale.
منابع مشابه
Assessment of disordered voices based on an optimized glottal source model
In this paper, a method for the assessment of disordered voices is proposed. A feature named mean opening quotient (MOQ) obtained from the glottal source estimation is used as an acoustic cue to summarize the degree of severity of the voice disorder. The analysis method uses the empirical mode decomposition algorithm to estimate the glottal source excitation signal from the speech signal. The l...
متن کاملPathological Voice Analysis and Classification Based on Empirical Mode Decomposition
Empirical mode decomposition (EMD) is an algorithm for signal analysis recently introduced by Huang. It is a completely datadriven non-linear method for the decomposition of a signal into AM FM components. In this paper two new EMD-based methods for the analysis and classification of pathological voices are presented. They are applied to speech signals corresponding to real and simulated sustai...
متن کاملA Fault Diagnosis Method for Automaton based on Morphological Component Analysis and Ensemble Empirical Mode Decomposition
In the fault diagnosis of automaton, the vibration signal presents non-stationary and non-periodic, which make it difficult to extract the fault features. To solve this problem, an automaton fault diagnosis method based on morphological component analysis (MCA) and ensemble empirical mode decomposition (EEMD) was proposed. Based on the advantages of the morphological component analysis method i...
متن کاملAssessment of Disordered Voices Using Empirical Mode Decomposition in the Log-Spectral Domain
Empirical mode decomposition (EMD) algorithm is proposed as an alternative to decompose the log of the magnitude spectrum of the speech signal into its harmonic, envelope and noise components and the harmonic-to-noise ratio is used to summarize the degree of disturbance in the speech signal. The empirical mode decomposition algorithm is a tool for the analysis of multi-component signals. The an...
متن کاملAssessment of vocal dysperiodicities in disordered speech based on empirical mode decomposition
in this paper, empirical mode decomposition (eMd) is proposed as an alternative method in the framework of acoustic analysis of disordered speech for the purpose of clinical evaluation of voice. the empirical mode decomposition algorithm decomposes adaptively a given signal into oscillation modes extracted from the signal itself. the proposed approach for objective assessment of vocal dysperiod...
متن کامل